37 research outputs found

    Moltemplate: A Tool for Coarse-Grained Modeling of Complex Biological Matter and Soft Condensed Matter Physics

    Get PDF
    Coarse-grained models have long been considered indispensable tools in the investigation of biomolecular dynamics and assembly. However, the process of simulating such models is arduous because unconventional force fields and particle attributes are often needed, and some systems are not in thermal equilibrium. Although modern molecular dynamics programs are highly adaptable, software designed for preparing all-atom simulations typically makes restrictive assumptions about the nature of the particles and the forces acting on them. Consequently, the use of coarse-grained models has remained challenging. Moltemplate is a file format for storing coarse-grained molecular models and the forces that act on them, as well as a program that converts moltemplate files into input files for LAMMPS, a popular molecular dynamics engine. Moltemplate has broad scope and an emphasis on generality. It accommodates new kinds of forces as they are developed for LAMMPS, making moltemplate a popular tool with thousands of users in computational chemistry, materials science, and structural biology. To demonstrate its wide functionality, we provide examples of using moltemplate to prepare simulations of fluids using many-body forces, coarse-grained organic semiconductors, and the motor-driven supercoiling and condensation of an entire bacterial chromosome

    How many human proteoforms are there?

    Get PDF
    Despite decades of accumulated knowledge about proteins and their post-translational modifications (PTMs), numerous questions remain regarding their molecular composition and biological function. One of the most fundamental queries is the extent to which the combinations of DNA-, RNA- and PTM-level variations explode the complexity of the human proteome. Here, we outline what we know from current databases and measurement strategies including mass spectrometry-based proteomics. In doing so, we examine prevailing notions about the number of modifications displayed on human proteins and how they combine to generate the protein diversity underlying health and disease. We frame central issues regarding determination of protein-level variation and PTMs, including some paradoxes present in the field today. We use this framework to assess existing data and to ask the question, "How many distinct primary structures of proteins (proteoforms) are created from the 20,300 human genes?" We also explore prospects for improving measurements to better regularize protein-level biology and efficiently associate PTMs to function and phenotype

    The genomic landscape of balanced cytogenetic abnormalities associated with human congenital anomalies

    Get PDF
    Despite the clinical significance of balanced chromosomal abnormalities (BCAs), their characterization has largely been restricted to cytogenetic resolution. We explored the landscape of BCAs at nucleotide resolution in 273 subjects with a spectrum of congenital anomalies. Whole-genome sequencing revised 93% of karyotypes and demonstrated complexity that was cryptic to karyotyping in 21% of BCAs, highlighting the limitations of conventional cytogenetic approaches. At least 33.9% of BCAs resulted in gene disruption that likely contributed to the developmental phenotype, 5.2% were associated with pathogenic genomic imbalances, and 7.3% disrupted topologically associated domains (TADs) encompassing known syndromic loci. Remarkably, BCA breakpoints in eight subjects altered a single TAD encompassing MEF2C, a known driver of 5q14.3 microdeletion syndrome, resulting in decreased MEF2C expression. We propose that sequence-level resolution dramatically improves prediction of clinical outcomes for balanced rearrangements and provides insight into new pathogenic mechanisms, such as altered regulation due to changes in chromosome topology

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Open Location

    No full text

    Web accessibility on campus

    No full text

    Assignment: Worklife

    No full text

    The Social Design of Worklife with Computers and Networks: An Open Natural Systems Perspective

    No full text
    If you read a broad sample of books or articles about computerization and changing work, you will find that groups of authors seem to be writing about completely different universes. Some focus on older technologies, or on current technologies; others explore the possibilities afforded by emerging technologies. A few writers will focus on those professionals who have significan
    corecore